Confidence intervals for policy evaluation in adaptive experiments
نویسندگان
چکیده
Significance Randomized controlled trials are central to the scientific process, but they can be costly. For example, a clinical trial may assign patients treatments that detrimental them. Adaptive experimental designs, such as multiarmed bandit algorithms, reduce costs by increasing probability of assigning promising over course experiment. However, because observations collected these methods dependent and their distribution is nonstationary, statistical inference challenging. We propose treatment-effect estimator has an asymptotically unbiased normal test statistic under straightforward, relatively weak conditions on adaptive design. This generalizes for variety parameters interest.
منابع مشابه
Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation
In many reinforcement learning applications executing a poor policy may be costly or even dangerous. Thus, it is desirable to determine confidence interval lower bounds on the performance of any given policy without executing said policy. Current methods for high confidence off-policy evaluation require a substantial amount of data to achieve a tight lower bound, while existing model-based meth...
متن کاملModel-averaged confidence intervals for factorial experiments
We consider the coverage rate of model-averaged confidence intervals for the treatment means in a factorial experiment, whenwe use a normal linearmodel in the analysis.Modelaveraging provides a useful compromise between using the full model (containing all main effects and interactions) and a ‘‘best model’’ obtained by some model-selection process. Use of the full model guarantees perfect cover...
متن کاملConfidence Intervals for Information Retrieval Evaluation
Information retrieval results are currently limited to the publication in which they exist. Significance tests are used to remove the dependence of the evaluation on the query sample, but the findings cannot be transferred to other systems not involved in the test. Confidence intervals for the population parameters provide query independent results and give insight to how each system is expecte...
متن کاملAdaptive Markov Chain Monte Carlo Confidence Intervals
In Adaptive Markov Chain Monte Carlo (AMCMC) simulation, classical estimators of asymptotic variances are inconsistent in general. In this work we establish that despite this inconsistency, confidence interval procedures based on these estimators remain consistent. We study two classes of confidence intervals, one based on the standard Gaussian limit theory, and the class of so-called fixed-b c...
متن کاملAdaptive Confidence Intervals for the Test Error in Classification
Sample Size n = 30 n = 100 n = 250 Data Set / Method ACI Yang Jiang ACI Yang Jiang ACI Yang Jiang ThreePt .976 .893* .914* .961 .552* .945 .961 .387* .930* Magic .955 .999* .983* .977* .991* .969* .972* .997* .974* Mam. .957 .989* .966 .962 .996* .964 .960 .995* .968 Ion. .947 .995* .985* .948 .996* .970* .970 .990* .970* Donut .968 .966 .908* .969 .851* .971* .973* .898* .966 Bal. .979* .996* ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the National Academy of Sciences of the United States of America
سال: 2021
ISSN: ['1091-6490', '0027-8424']
DOI: https://doi.org/10.1073/pnas.2014602118